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The property of locality in progrann behavior has been studied and nnodelled extensively 
because of its application to mennory design, code optimization, multiprogramming etc. We 
propose a k order Markov chain based scheme to model the sequence of time intervals 
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To design effective large-scale multiprocessors, designers need to understand the 
characteristics of the applications that will use the machines. Application characteristics of 
particular Interest Include the amount of communication relative to computation, the 
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Previous research has shown that the SPEC benchnnarks achieve low miss ratios in 
relatively small instruction caches. This paper presents evidence that current software- 
development practices produce applications that exhibit substantially higher instruction- 
cache miss ratios than do the SPEC benchmarks. To represent these trends, we have 
assembled a collection of applications, called the Instruction Benchmark Suite (IBS), that 
provides a better test of instruction-cache performance. We discuss th ... 
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In this paper, we present a new design-space exploration algorithm, the architecture 
explorer (AE), for analyzing performance/cost tradeoffs in memory-intensive applications. 
AE evaluates FU, bus, and memory cost for a series of performance constraints to produce 
a performance/cost tradeoff curve. Unlike previous approaches, AE handles both 
hierarchical and non-hierarchical memory architectures with various speeds of memory. 
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This paper discusses implementations of fine-grain memory access control, which 
selectively restricts reads and writes to cache-block-sized memory regions. Fine-grain 
access control forms the basis of efficient cache-coherent shared memory. This paper 
focuses on low-cost implementations that require little or no additional hardware. These 
techniques permit efficient implementation of shared memory on a wide range of parallel 
systems, thereby providing shared-memory codes with a portability ... 

MO B forms: a cl ass of mu l tilev el block alg o ri thms for dens e line ar a lgebr a opeMions 
Juan J. Navarro, Toni Juan, Tomas Lang 



http://portal.acm.org/resultsxfm?query==meniory%20hierarchy%201evels& 8/2/04 



Results (page 7): memory hierarchy levels 



Page 3 of 5 



July 1994 Proceedings of the 8th international conference on Supercomputing 

Full text available- pdf(1 00 MB) Additional Information: fu|J ciMQD, a bstra ct, re f er e nc e s, citings, in dex 
' ^ " t e r ms 

Multilevel block algorithms exploit the data locality in linear algebra operations when 
executed in machines with several levels in the memory hierarchy. It is shown that the 
family we call Multilevel Orthogonal Block (MOB) algorithms is optimal and easy to design 
and that using the multilevel approach produces significant performance improvements. 
The effect of interference in the cache, of the TLB misses, and of page faults are also 
considered. The multilevel block algorithms are evaluate ... 
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We describe an implementation of Level-3 BLAS (Basic Linear Algebra Subprograms) 
based on the use of the matrix-matrix multiplication kernel (GEMM). Blocking techniques 
are used to express the BLAS in terms of operations involving triangular blocks and calls to 
GEMM. A principal advantage of this approach is that most manufacturers provide at least 
an efficient serial version of GEMM so that our implementation can capture a significant 
percentage of the computer performance. A parameter whi ... 
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The performance of two-level on-chip caching is investigated for a range of technology and 
architecture assumptions. The area and access time of each level of cache is modeled in 
detail. The results indicate that for most workloads, two-level cache configurations (with a 
set-associative second level) perform marginally better than single-level cache 
configurations that require the same chip area once the first-level cache sizes are 64KB or 
larger. Two-level configurations become even more import ... 
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Large-scale shared memory multiprocessors typically support a multilevel memory 
hierarchy consisting of per-processor caches, a local portion of shared memory, and 
remote shared memory. On such machines, the performance of parallel programs is often 
limited by the high latency of remote memory references. In this paper we explore how 
knowledge of the underlying memory hierarchy can be used to schedule computation and 
distribute data structures, and thereby Improve data locality. Our study i ... 
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This paper investigates issues involving writes and caches. First, tradeoffs on writes that 
miss in the cache are investigated. In particular, whether the missed cache block is 
fetched on a write miss, whether the missed cache block is allocated in the cache, and 
whether the cache line is written before hit or miss is known are considered. Depending on 
the combination of these polices chosen, the entire cache miss rate can vary by a factor of 
two on some applications. The combination of no- ... 
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The use of non-volatile semiconductor memory within an extended storage hierarchy 
promises significant performance improvements for transaction processing. Although 
page-addressable semiconductor memories like extended memory, solid-state disks and 
disk caches are commercially available since several years, no detailed investigation of 
their use for transaction processing has been performed so far. We present a 
comprehensive simulation study that compares the performance of these storage ty ... 

Results 121 - 140 of 200 Result page: previous 12345678910 next 

The ACM Portal is published by the Association for Computing Machinery. Copyright © 2004 ACM, Inc. 
T erms of Usa ge Pnyacy .Policy Code of Ethics Con tact Us 

Useful downloads: Adobe Acrobat @ QuickTime B Windows Media Player ^ Real Plaver 



http://portal.acm.org/resultsxfin?query=memory%20hierarchy%201e 8/2/04 



